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Abstract 

We consider the change-point problem for the marginal distribution of subordinated 
Gaussian processes that exhibit long-range dependence. The asymptotic distributions of 
Kolmogorov-Smirnov- and Cramer-von Mises type statistics are investigated under local 
alternatives. By doing so we are able to compute the asymptotic relative efficiency of the 
mentioned tests and the CUSUM test. In the special case of a mean-shift in Gaussian data 
it is always 1. Moreover our theory covers the scenario where the Hermite rank of the 
underlying process changes. 

In a small simulation study we show that the theoretical findings carry over to the finite 
sample performance of the tests.. 

Keywords: asymptotic relative efficiency, change-point test, empirical process, local al¬ 
ternatives, long-range dependence. 


1 Introduction 


Over the last two decades various authors have studied the change-point problem under long- 
range dependence and classical methods are often fonn d to yield different re s ults t han nnder 
short-range dependence. The CUSUM test is studied inlCsorgo and HorvathI (119971 1 and com¬ 
pared to the Wilcoxon change-point test in Dehling et al. ( 2012 1. Ihine ( 2007 1 investigates 
a Darling-Erdos-type resnlt for a parametric chan g e-poin t tes t, and estimators for the time 


of change are considered in iHorvath and Kokoszkal (|l997l l and 


Hariz et al. 


Horvath and Shan ( 

19991 are 

time series and 

Berkes et al. 


j2009h More over, 


()200(tI 1 and iBaek and Pinirasl (j201lll construct tests in order to 


discriminate between stationary long memory observations and short memory sequences with 
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Kokoszka and Leious ( 

2001 

) and the associated chapter in 

Beran et al. 

(2010) 


One of the classical change-point problems is the change of the marginal distributions of a time 
series When testing for at most one change-point (AMOC) in the marginal distribution 

one often considers the empirical distribution function of the first k observations and that of 
the remaining observations. Taking a distance between the empirical distributions and the 
maximum over all A: < n yields a natural statistic. Common distances are the supremum 
norm, which gives the Kolmogorov-Smirnov statistic 


Tn = max sup 

l<fc<n3,g]a 


El 


k 


{Yi<x} 


- Vi 

n ^ 


{Yi<x} 


2 = 1 2=1 

or an L^-distance, which gives the Cramer-von Mises statistic 


( 1 . 1 ) 


Sn = max 
l<A:<r2 



k 


{Yi<x} 


n 


^ ^ ^{Yi<x} 1 dFji(x). 


( 1 . 2 ) 


i=l 


Both are widely used for goodnes s-of-fit tests a nd tw o-sample problems. In the chang e -point 


literature they are considered by ISzvszkowica (11994 ) for i ndependent data, by 
for strongly mixing sequences and by 


Giraitis et al 


Inoue 


(|200l|) 


(jl996bl ) for linear long-memory processes. 


However, note that in the LRD setting only the Kolmogorov-Smirnov test has been investi¬ 
gated. 

(HH) and (jl.2p are functionals of the sequential empirical process, that is Yl\=i 
for t G [0,1] and x G R. Thus the asymptotic distributions of and Sn rely on that of the 
sequential empirical process. For weakly dependent sequences this would be a Gaussian process, 
in the special case of independent random variables it i s called Kiefer-Muller proce ss. For 
stationary sequences that exhibit long-range dependence, iDehling and Taaaul (|l989al l proved 
that the limit process is of the form {J{x)Z{t)}t^x, where J{x) is a deterministic function 
and the process is therefore called semi-degenerate. They considered subordinated Gaussian 
processes, in detail Yi = G{Xi) for any measurable function G and a Gaussian sequence W 
with non-summable autocovariance function. A similar limit structure was later obtained 


Giraitis et al. 


(jl996al l for long-range dependent 


independently by IHo and Hsind (jl99fil l and 
moving-average sequences. 

It is the main goal of this paper to derive the limit distribution of change-point statistics of 
the type dni) and (II.2p under local alternatives. We then apply these results to derive the 
asymptotic relative efficiency (ARE) of several change-point tests. To this end we investigate 
the sequence 


Gi(Ai), . . . , Gi(Afc*), Gn{Xk* + u ) • • • Gn(Xn), 


(1.3) 
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Here Gn is a sequence of functions such that the distribution of Gn{Xi) converges to the 
distribution of G{Xi) in some suitable way. 

Therefore, we are able to analyze various types of change-points, among them a mean-shift. 
Thus we may compute the ARE of Kolmogorov Smirnov, CramCT-von Mises, CUSUM and 
Wilcoxon test and get the surprising result that in case of Gaussian data it is always 1. 

The mathematically most challenging case is the situation when the Hermite rank changes. 
The Hermite rank of the class {l{G(.)<a;} “ -^(3^)}3;eR is defined as the smallest positive in¬ 
teger, such that E\li^Q(^Xi)<x}Hq{Xi)\ ^ 0 for some x € R, with Hq being the g-th Hermite 
polynomial. The structure of the limiting process Z(t), e.g. the marginal distribution and 
the covariance structure, mainly depends on m. However, a special feature of distributional 
changes in subordinated Gaussian processes is the fact that the Hermite rank may change, too. 
Hence the question arises which Hermite process will determine the limit distribution. Under 
a mean-shift the Hermite rank remains unchanged, whi ch can be seen easily b y its definition. 


Our results differ in various ways from those obtained in 


Giraitis et al 


(|l996bl l. where changes 


in the coefficients of an LRD linear process were investigated. While the empirical process of 
LDR moving average sequences converges to fractional Brownian motion, we may encounter 
higher order Hermite processes. The possible change in the Hermite rank is therefore a novel 
feature in our investigation. 

The rest of the paper is organized as follows. In section [2] we will state a limit theorem 
for the sequential empirical process under change-point alternatives. Moreover we will give 
the asymptotic distribution of the test statistics under the hypothesis of no change as well as 
under local alternatives. Thus we are able to derive the asymptotic relative efficiency of several 
change-point tests. In section [23] we consider the empirical process for long-range dependent 
arrays that are stationary within rows. The outcome mainly serves as a device for proving the 
main results, but is also of interest on its own. Section [3| contains the simulation study. To the 
best of our knowledge there are no results on the finite sample performance of the Cramer-von 
Mises change-point test under long memory. It is compared to other change-point tests and 
the effect of an estimated Hurst-coefficient is discussed. We obtain that the theoretical results 
(e.g. asymptotic relativ efficiency between CramCT-von Mises and GUSUM test) carry over to 
the finite sample performance of the tests. Finally proofs are provided in section 0] 


2 Main results 

Let {Xi}i->i be a stationary Gaussian process, with 

EXi = 0, EXf = 1 and p{k) = EX^Xk = k-^L{k), 

for 0 < Z) < 1 and a slowly varying function L. The non-summability of the covariance function 
is one possibility to define long-range dependence. We investigate our results for so called 
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subordinated Gaussian processes {Yi}i>i, where Yi = G{Xi) and G: R —)• R is a measurable 
function. The key tool in our analysis of possible changes in the marginal distribution of such 
a process is the sequential empirical process. To obtain weak convergence of this process the 
right normalization is given by dn^m, defined by 


dl,m = Var ~ n^^L^{n), 


( 2 . 1 ) 


V. i=l 


where the constant of proportionality is 2m!(1 — mD) ^{2 — mD) see Theorem 3.1 in iTaaau 
(j 19751 1 . H = 1 — mD j2 is called Hurst coefficient and 


m = min {g > 0 | E[l{G{Xi)<x}Hq{Xi)] / 0 for some x] , 


i s the Hermite rank of {l{G(-)<a;} “ The mentioned result of iDehling and Taaau 

(Il989al l then reads as follows. 


Theorem A (Dehling, Taqqu). Let the class of functions {l{G(')<a;} “have Hermite 
rank m and let 0 < D < 1/m. Then 


■ 

’ 0 — 


2=1 


( 2 . 2 ) 


where the convergence takes place in I1([0,1] x [— 00 , 00 ]), equipped with the uniform topology. 
Jm{x) is defined by 


Jm{x) — E[l^G{Xi)<x}ddm{Xi)] 


and iZm,H{t))t£[o,i] 'Is o,n m-th order Hermite process, see Taaau tl97t\) for a definition. 


Remark 2.1. In the case m = 1, the Hermite process becomes the well known fractional 
Brownian Motion, which we denote by Bnit). 


2.1 The empirical process under change-point alternatives 

Let us consider the following change-point model. Define the triangular array 


Y ■ — 

^ n,i — 


G{Xi), ifz<[nrj, 
Gn{Xi), if i > [nrj -b 1, 


(2.3) 


for measurable functions G and {Gn)n and unknown r € (0,1). For r = 0 one gets a row-wise 
stati onary triangular array, as co nsidered in section 12.51 and for r = 1 a stationary sequence. 


as m 


Dehling and Taaaul (jl989al L In what follows we will denote the distribution functions of 
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G{Xi) and by F and -F(n)) respectively. 

To obtain weak convergence of the empirical process of (I2.8p we have to make some assump¬ 
tions on the structure of the change and the Hermite rank. 


Assumption A: 

Al. The class of functions has Hermite rank m with 0 < D < 1/m. 

A2. Let m{n) be the Hermite rank of {l{Gn(-)<x}}xGR and m* = liminf^^co m(n). Then we 
assume 


^(m-m*)D(i+5)/2 sup(P(min{G(Ai), Gn(Xi)} < x) - P(max{G(Ai),Gn(A:i)} < x)) 0, 


for some (5 > 0. 


Theorem 1. If Assumption A holds, then 


lnt\ 


P{Yn,i F x)) 


V 


J n 


2 = 1 


m\ 


~ Xyyi}J (f) , 


where Jm{x) is the Hermite coefficient of l^cQ<x}- Phe convergence takes place in T>([0,1] x 
[— 00 , 00 ]), equipped with the uniform topology. 

Remark 2.2. (i) For given functions G{x) and Gn{x), Assumption A2 might easily being 
checked, see the examples below. It serves to ensure convergence of the Hermite coefficients 
Jq,n{x) = E[l{Gr,iXi)<x}Hq{^i)]- hi detail. 


sup(P(min{G(Ai),Gn(Ai)} < x) - P(max{G(Ai), Gn(Xi)} < x)) 0 


implies, see the proof of Lemma 14.51 


sup] Jq^„(x) — Jg(x)| —)■ 0 Vg G IN. (2-4) 

By Assumption Al, Ji(x) = ... Jm-iix) = 0 for all x G R, yet Jm{x) 7 ^ 0 for some x. Together 
with ()2.4p this implies m* = liminf„_,.oo m(n) < m. 

(ii) Moreover, A2 implies convergence of the marginal distribution function. To see this, note 

\Pin)ix) - T(x)| = max{F(„)(x),F(x)} - min{F(„)(x),F(x)} 

< P(min{G(X),Gn(A:)} < x) - P(max{G(A), G„(X)} < x) 

and — 0(1). However, the converse is not always true. Consider for instance 

the functions G{x) = x and Gn{x) = Gi(x) = —x or the situation in ExamDle l2.8l Then again, 
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there are lots of natural choices of G and Gn for whom convergence of the marginal distribution 
functions (with a certain rate) implies Assumption A2. Among them Gn{x) = G{x) + 
(mean-shift), Gn{x) = anG{x) (change in variance) and 


Gn{x) = o ^{x) and G{x) = F ^ o $(x) 


-1 


(hi) Our assumptions explicitly allow for the Hermite rank to change together with the marginal 
distribution. Then again, the limit behaivior seems to be untouched by this change. Intuitively 
this corresponds to the idea that the change in distribution and the change in the Hermite 
coefficient, both caused by the difference of G and Gn, are of the same order. For q < m this 
enforces the function Jq^n{x) to converge rather fast to 0. Technically this can be explained 
through A2. If this assumption is dropped, we might actually encounter limits with multiple 
Hermite processes. Such cases will be considered in Example 12.81 and Corollary 12.131 
(iv) If A1 is viol ated, the sequence {C(Ai)),;>i is actually short-range dependent. For station¬ 
ary observations ICsorgd and MielniczukI ( 19961 1 showed convergence of the sequential empirical 
process to a two-parameter Gaussian process. Change-point alternatives have not been consid¬ 
ered for such random variables, yet, but would require fundamentally different proofs compared 
to our results. 


2.2 Asymptotic behavior of the change-point statistics 

We now apply the results concerning empirical processes to determine the asymptotic distri¬ 
bution of the Kolmogorov-Smirnov statistics 


7-1 


Tn = sup sup d'n^n^ 
te[o,i] xgR 


[ntj 

El 

i=l 


[nt\ ^ 

{y„,i<x} ^ 2^ ^{yn,i<x} 


2=1 


(2.5) 


and that of the Cramer-von Mises change-point statistic 


Sn = dnL sup 


tG[0,l] Jk 


[nt\ 

El 

2 = 1 


{yn,i<x} 


[nt\ 


n 


El 

2=1 


{y„,i<x} 


dFnix). 


( 2 . 6 ) 


To get a non degenerate limit under a sequence of local alternatives it is important to choose 
the right amount of change. For a mean-shift this is naturally the difference of the expectations 
before and after the change. For a general change we formulate the test problem as follows: 
We wish to test the hypothesis 


H : Assumption A1 holds and Gn{x) = G{x) for all x G E, and n > 1, 
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against the sequence of local alternatives 


An : Assumption A holds and, for n —)• oo, 

71 

- -(F(x)-F(n)(x)) (2.7) 

uniformly in x, where g{x) is a measurable function of bounded 
total variation, whose support has positive Lebesgue measure. 


Remark 2.3. Note that nd^\^ ~ '"/^(n). Thus (12.7D implies 

n("*)^(i+'^)/2(T(x) - F(n)(x)) ^ 0, 


for 6 < m*j(m — m*) or m* = m. This again implies Assumption A2 for certain choices of 
functions G and see Remark 12.21 fiii. 

Theorem 2. (i) Under the hypothesis H of no change we have, as n ^ oo, 


and Sri 


V 


V 


sup| Jm(x)/(m!)| sup 
xeR ie[o,i] 


/ iJmix)/{m\))‘^ dF{x) sup Zm,H{t) 

/xgR t€[0,l] 


where Zm,H{t) = Zm,H{t) -tZm,H{l)- 

(a) Under the sequence of local alternatives An we have, as n ^ oo. 


and 


Tn 


V 


sup sup 
xGRte[o,i] 


dm {x)/ {m\)Zm,H{t) - g{x)fjr(t) 


Sn 


V 


sup 
tG[0,l] ■ 


a::€lR 


Jm {x)/(ml)Zm,H{t) - gix)^;^ 


dF{x), 


where 


ifrit) 


i(l-T), ift<T, 
ift>T. 


Motivated by this Theorem we consider change-point tests based on the statistics Tn and Sn- 
Critical values might be chosen as 


sup|and / {Jm{x)/{m\)f dF{x)ql_rr^rn,H^ 

for the Kolmogorov-Smirnov test and the Cramer-von Mises test, respectively. Here qi-a,ra,h 
is the (1 — a)-quantile of sup^gjg,!] I Zm,H (t) \ ■ Thereby the tests have asymptotically level a and 
nontrivial power against local alternatives. 
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The tests can be performed, if the right normalization for the empirical process, the supremum 
of Jm{x) and the distribution of sup^gjQ (t)| are kn own. In prac tical applications this 


might be not the case. Solution s are se l f-norm alization 
Hurst -coefficient (see for example 


(|shaol ^^ )). 


estimating t he the 


Kiinsch 


(|l987 H and bootstrap estimators for Jm{x) (Tewes 


2.3 Examples 


Example 2.4 (Mean-shift). Let Gn{x) = G{x) + Hn with /i„ ~ d„/n, then we get the typical 
change in the mean problem. In the case of long-range dependent subordinated Gaussian 
proces ses t his was conside red in iDehling et al.l (j2012l . l2013l i , ICsorgd and HorvathI (|l997l l , IShao 


(1201 il l and iBetkenI (j2016l L Let fc be the probability density of G(Xi), and assume that it is 
continuous and of bounded variation. Then we obtain 


71 71 

- -(F(x) - T(„)(x)) = -- {F{x) - F{x - fin)) Gfcix), 

where, due to continuity of fc, the convergence holds uniformly. 

Example 2.5 (Change in the variance). To describe the change-in-variance-problem dehne 
Gn{x) = 1/(1 — 5n)G{x), with 5n ~ dnjn. For ease of notation let 6n = dn/n. Then we get 


sup \5n^{F{x) - Ffnfix)) - x/g(x)| 




= sup\6n^{F{x) - F{x - 6nx)) - xfcix)] 


= sup 

xGR 

< sup 


xF{x) — [x — 5nX)F{x — 5nX) 


SnX 

xF{x) — {x — 5nX)F{x — 5nX) 


6nX 


- F{x - 5nx) - xfcix) 


- {xfcix) + T(x)) 


sup|E(x - Snx) - T(x)|. 


( 2 . 8 ) 

(2.9) 


The derivative of xF{x) is xfcix) + Fix), hence ()2.8I) converges to 0. The convergence is 
uniform, if fc and F are continuous. (|2.9p converges to 0, because of continuity, monotonicity 
and boundedness of F. Thus (12.7p holds with function gix) = xfcix). Assume without loss of 
generality cr„ = l/(l — (5„)>1, then 


Pmax{G(Ai),Gn(Ai)} < x) 

= PianGiXi) <, Gix) > 0) + PiGiXi) < x, G(Xi) < 0) 


Fix/an), if X > 0, 
Fix), if X < 0. 



































The minimum can be treated analogously, hence Assumption A2 follows from convergence of 
the marginals. 

Additionally one might consider a combined change in mean and variance, given through 
Gn{x) = (JnG{x) + pin- In this case ()2.7p holds with g{x) = /g'(x)(C'i + G 2 X). 


Example 2.6 (Generalized inverse of a mixture distribution). By using the generalized inverse 
of a distribution function one could generate su bordinated Gaussian processes with any given 
marginals, see for example iDehling et al.l (j2013l b We use this for the change-point problem by 
setting 


G = F-^o^ and Gn - o ^>. 
For a continuous distribution function F* define the mixture 


F^n){x) = {1 - 6n)F{x) + 6nF*{x), 

with 6n ~ dnn~^. Then (12.7p holds with g{x) = F*{x) — F(x) and moreover 

P(max{G(Ai),G„(Ai)} < x) = P(max{F-^ o $(Xi),p-J o ^.(Ai)} < x) 

= PmXi) <mm{F{x),F^n){x)}) 

= min{P(x),F(„)(x)}. 

Analogously one has P(min{G(Ai), Gn(Ai)} < x) = max{P(x), P(„) (x)}. Hence 

P(min{G(Ai),Gn(Ai)} < x) - P(max{G(Ai), Gn(Ai)} < x) = |P(n)(a:) - F{x)\, 


thus Assumpt ion A2 is als o satished. For strongly mixing data similar local alternatives were 
considered by llnouel (1200111 . 


Example 2.7 (^^-distribution). Gonsider a y^-distribution given through G(x) = x^ and not e 
that the indicator functions have Hermite rank m = 2, see also IDehling and Taaod ()l989al l. 
Further let 


Gn(x) = 


a^x^, if X > 0, 


X 


if X < 0, 


with Hermite ranks m{n) = 1 for all n E IN. If (o„ — 1) ~ dn^ 2 /n, then one can show (similar 
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to the case of a variance change in Example 12.5p that 


{P {G{X,) < x) - P {GniXi) < x)) 
G^/x(|){^/x)ll0^oo){x), 


uniformly in x. As Assumption A2 is satisfied, too, we may apply Corollary [2] (ii) with function 
g{x) = Cy/x4'{y/x)l[Q^^){x) and m = 2. 

Example 2.8 (Multiple Hermite processes in the limit). In the previous example, together 
with the marginal distribution, also the Hermite rank has changed. However, the limiting 
process seems to be untouched by this fact and one might ask whether this is intuitive or not. 
It is caused by the fact that the change in the distribution and the change in the Hermite 
coefficients, both originating in the difference of the functions G{x) and Gn(x}, are of the same 
order. 

To get an additional Hermite process in the limit, one would need (un — 1) ~ dn, 2 /dn,i, see 
Corollary 12.131 and its proof. But then 


-—sup \F{x) - E(„)(a:)| 

Un,2 X 


n rfn,l 
dn^l dti^2 


sup|F(x) -F(„)(x)| 


—>■ oo, 


and the test would have asymptotic power 1. 

To achieve nontrivial asymptotic power one has to consider structural breaks that consists of 
two aspects and where only one is captured by the marginal distribution. To this end define 
the transformations 


G{x) = $-^(F(|x|)) = $-^(2$(|x|) - 1) 


and 


Gn{x) = ^-\Fl^^iG*^ix)) + gn, 
where F*^^{x) = P{G^{Xi) < x) and 

{ ttnx'^, if X > 0, 

x^, if x < 0, 

for some sequence {an)n with 0^/1 and — >• 1. On the one hand, {1{g()<x}}x has Hermite 
rank m = 2 and G{Xi) ~ N{0, 1). On the other hand, {l{Gn()<a:}}x has Hermite rank m{n) = 1 
for all n G M and Gn(Xi) ~ !)■ Now let fj-n ~ dn, 2 /n., then Example 12.41 applies and we 
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obtain 


77 77 

-I -(^(n)(a;) - F{x)) = -1 -(^(x - Hn) - ^(x)) C4>{x), 

«n,2 Mn,2 

for any sequence {an)n- In contrast, the convergence of the Hermite coefficients is highly 
influenced by {an)n- If the sequence is chosen such that {an — 1) ~ dn, 2 /dn,i (therefore, it 
converges slower than then the sequential empirical process will converge towards 

/•^ 2 (x)/ 2 Z 2 (t), ift<r, 

yji{x)Zi{t) + J2{x)/2Z2{t)+, \it>T. 

Actually this can be proved similar to Corollary 12.131 Moreover, the Kolmogorov-Smirnov 
statistic converges weakly to 

sup sup|A'(a;, t) — tK{x, 1) — ^jJr{t)C(j){x)\. 

iG[0,l] xGR 

We find this example rather pathological, therefore such situations are excluded from the main 
results via Assumption A2. 


2.4 Asymptotic relative efficiency 


By studying the asymptotic distributions under local alternatives one might compare different 
tests in terms of the asymptotic relative efficiency (ARE). Here we give a precise definition of 
the are in th e very special context of our change-po int setting. Th e general idea is due to 


PitmanI (119481) (for a published article see for example [NoetheiJ (jl95ril H and was formalized in 


Noethen (jl955l i. Of course it can be extended to all kinds of testing procedures. 


Definition 2.9. Let Ti and T 2 represent two change-point test procedures. Consider the local 
alternatives 


{G,Gni^,T) and a sample size {nk)k, 

{G,Gmk,x) and a sample size {mk)k, 

such that Gn^{x) = Gmu{x) = Gk{x) for all /c > 1 and x G R. 

Let /3i be the asymptotic power of the test Ti against the local alternatives given by (G, Gn ^,, t, {nk)k) 
and (32 be the asymptotic power of the test T 2 against the local alternatives given by (G, Gm^ > x, {mk)k)- 
If /3i equals (32-, then the asymptotic relative efficiency (ARE) of the tests Ti and T 2 is defined 
as 


ARE{n,T2) 


ruk 

hm -. 

k^oo 71]^ 
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Example 2.10 (Mean-shift in Gaussian data). Consider G{x) = x and Gn{x) = G{x) + Hn, 
in other words a mean-shift in Gaussian data. As for the Hermite coefficient function, we 
get Ji{x) = —(p{x), where (p is the standard normal probability density. Thus, according to 
Corollary [2l the test statistic converges towards 


sup|(/)(x 


sup 

ie[o,i] 


Bnit) + Gijjr{t) 


(27r 


1 - 1/2 


sup 

ie[o,i] 


Bnit) + G'4>r{t) 


whereas under the Null, that is we have a stationary standard Gaussian sequence, the limit 
distribution would be 


sup ())(x) sup 

Bnit) 

= (27r) sup 

Bnit) 

xeR te[o,i] 





For the Cramer-von Mises statistic we obtain analogously the limit distributions 


/ (j)^{x)dx sup 

Bnit) + Giprit) 

2 

and 

/ (j)^{x)dx, sup 

Bnit) 

/ te[o,i] 



/ te[o,i] 



under local alternative and hypothesis, respecti vely. Hence in this special case the CUSUM 
test, the Wilcoxon test fsee iDehling et al.l (j2013|) for each), the Kolmogorov-Smirnov test and 
the Cramer-von Mises test all have the same asymptotic power, namely 


P ( sup \BH{t) +G'lpr{t)\ > qi-a,H] , (2.10) 

Vte[o,i] J 

where qi-a,H is the (1—a)-quantile of the maximum of a fractional Brownian bridge sup^gjg,!] (^)l- 
As a direct consequence, one gets that the ARE of the four tests is 1. This result is quite sur¬ 
prising, keeping in mind that CUSUM and Wilcoxon tests are designed to detect level-shifts, 
while our tests have power against all kinds of distributional changes. 

For non-Gaussian data and change-points beyond a simple mean-shift, the investigation of the 
ARE is not that straightforward. In fact, little is known about the distribution of 


sup|Rh(/) + /(/)!, 

t 

and even less if higher order Hermite processes are considered. This seems to prevent a precise 
computation of the ARE in many cases. However, one might derive lower bounds for the 
efficiency as we do in next example for a combined change in mean and variance. Unlike in 
the previous example, we will make use of the subtle definition of the ARE. 

Example 2.11 (Combined change in mean and variance). Let G{x) = x and Gk{x) = a^x+^k, 
that is a combined change of mean and variance in Gaussian data. If further fikk/dk^i Gi > 
0 and {1 — l/ak)k/dk^i —>■ G 2 > 0, then by example 12.51 the empirical bridge-type process 
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converges to (for the sample size k oo) 


4>{x)BH{t) + 4>{x){Ci + C 2 x)ijjrit), X G R, t G [0,1]. 


We now consider slightly modified Cramer-von Mises and CUSUM tests, in detail, instead of 
[0,1] the supremum is taken over [ki, K 2 ] for some ki G (0,1/2) and K 2 G (1/2,1). 


The asymptotic distribution of the CUSUM test has been derived in iDehling et alJ (j2013j), 
but only in the case of a mean-shift with constant variance. However, for EG‘^{Xi) < 00 
the CUSUM statistic is a continuous functional of the sequential empirical process. Thus, we 
might apply our Theorem [T] and conclude that the CUSUM statistic converges under this type 
of local alternatives to 


sup 

te[Ki,K2] 
= sup 

t€[Kl,K2] 


^ (pix) + (Cl -b C 2 x)'ipr{t)^ dx 

Bnit) + Ciiprit) 


Note that this is the same limit as under a mean-shift with constant variance and thus, too, 

the asymptotic power is the same as in example 12.101 

The limiting distribution of the Cramer-von Mises statistic is given by 


= 


sup [ (f{x) (Bnit) + (Cl -b C 2 x)ip^ 


te[Ki,K2] 


dx 


sup I [ (p^{x)dx (Bnit) + Ci'ijjrit)) -bC|V’r(^) [ 4>^{x)x^dx 

[kiM U ^ ^ J 


te[Ki 


and for its asymptotic power we obtain 


P[Z^ > j 


dx 


= P[Z^> ql_. 


H / (j)^{x) dx , sup {Bnit)} > qi-a,H 

J te[Ki,K2] , 


( 2 . 11 ) 


H 


+ P [z‘^ > q‘f_a^H / dx , sup {Bnit)} < qi-a, 

\ J te[Ki,K2] / 

First assume siip^{BH(t)} < q = qi-a,H and consider C*, given by 
C* = f*{Cl,C2,q,T,Kl,K2) 

\Jq^ + ‘^qCl1pr{t) + (c^ -b C|(/ 'lp^{x)x‘^dx/ f (/3(x)dx)) Ipri't) — Q 


= mm 
te[Ki,n2] 


^pT{t) 
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Now Cl is constructed in a way, such that 


0 


Cl > Cl 


and for all a; G ri with sup^ < q 


= 


sup I [ (f)^{x)dx (Bnit) + Cltprit)) +C' 2 V'r(^) [ 4'^{x)x‘^dx 

\ki,K 2] [J ^ ^ j 


te[Ki,K2] 

> sup I f (p^{x)dx (Bnit) + Cl'^l>^ 
t£[Kl,K.2] I-/ 


If, on the other hand, supj{i?//(t)} > qi-a,H, then (because Ci > 0) automatically > 
Qi-a HI 4>^{x)dx. Combining these two findings with (|2.11|) we can bound the asymptotic 
power from below by 


P[Z^> ql_^„ 


= P 


J c 

sup [ (j)^{x) (Bnit) + Cl-iprit)) dx > ql_^ H [ (f{x)da 
,te[Ki,K2] J ^ ^ d 


> P I sup 

WG[ki,K2] 


Bnit) + Cliprit) 


> qi-a,H , 


( 2 . 12 ) 


for Cl > Cl. 

Now we are ready to compute the ARE. To this end we chose different sample sizes for both test. 
In detail, {nk)k for the CramCT-von Mises test and {mk)k for the CUSUM test. Moreover, the 
local alternatives are such that C^^^(x) = C^^^^{x) = Ck{x) for all x G R, consequently 

= IJ'ml = IJ'k and 


For the CUSUM test, in order to achieve at least asymptotic power /3, its limit distribution 
has to satisfy 


P sup \BH{t) + Cllpr{t)\ > qa,H > /3. 
Vte[o,i] / 


In other words. Cl = ir ^{(3), where 


7r(C*) = P { sup iBnit) + Cl'iprit)\ > qa,H 
Vt6[o,i] , 


^Here weed the restriction to [ki,K2]. 
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and TT ^ is the generalized inverse. Therefore, the sample size of the CUSUM test has to be 
chosen such that 


C: = lim 


ruk 


fc—>-00 dm I, fc—>oo d- 


= lim 


rrik 


~ ■ 


^nik 


’TUk 


We also obtain (as /i^ and (1 — l/cTk) are of the same order) 


(2.13) 


lim 


ruk 


fc—>-00 d' 


nik 



lim 


ruk 


k^OD d' 


nik 



a 


2 ) 


for some > 0. 

Next we will select the sample size for the Cramer-von Mises test such that its asymptotic 
power might be bounded from below as in (I2.12p (and therefore by /3). To this end choose 
Cl > 0, such that 


fiCi) 


c* 

f{Ci,Ci-^,q,T, ki,K2) 
^1 


Cl 


The function /: [0, oo) —)■ [0, oo) is monotone increasing, surjective and continuous (as the 
minimum is attained either in ki or K 2 ), therefore such an Ci always can be found. By 
construction of the function / it follows that Ci < Cl Now let the sample size of the CramCT- 
von Mises test satisfy 


Cl = 


1 . '^k ( 1 ) -I. 

iim ——/i^/ = iim —— 


k^oo d' 


•rik 


rik 


/ c —>-00 d ' 


l^k • 


rik 


(2.14) 


Moreover, we observe 


C 2 = lim nk/dnt,{X - 1/ufc) 

fc^OO 

= lim {nk/dn^fik) imk/dmkfJ-k)~^ (rnk/dm^il - l/(Tk)) 
k—yoo 

= CiCl/Cl 


Thus, 


f{Cl,C2,q,T,Kl,K2) = Cl 
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and we observe for the asymptotic power of the Cramer-von Mises test 


P 


sup I f 4>^{x) dx (Bnit) + Ci'ijjrit)] + f 4>^{x)x‘^ dx \ > qi-a,H j (f)^{x)dx 

.te[Ki,K2] ^ J ) J I 


> P I sup 

\te[Ki,K2] 

>13. 


Bnit) + ClMt) 


> qi-a,H 


So both test have (at least) asymptotic power /? against the local alternatives (G, Gfc,r). 
Finally, 


/ rnk 

V 'nk 

by construction of the sample sizes and the definition of slowly varying functions. Consequently 

ARE{CvM,CUSUM) = > 1. 

In other words, the Cramer-von Mises test is asymptotically more efficient, no matter how 
small the additional variance-change is. 


ruk dn^ 

L(V2)(nfc) 


2.5 The empirical process of triangular arrays 


Dehling and Taaaul (jl989al lbfl. uniform reduction principles have become the 


Since the work of 

main tool in the analysis of empirical processes of long-range dependent data. More precisely, 
the empirical process gets approximated only by the first term of its Hermite expansion (if the 
underlying process is not Gaussian other expansions are available). However, most results are 
investigated for stationary sequences. When considering G{Xi ),..., Gn(^[nTj+i)) ■ ■ ■) GniXy, 

the empirical process of t he first [nrj random variables can be approximated just as in 
Dehling and Taaod (jl989al b In contrast the Hermite expansion of ^{Gn{Xi)<x} ~ T'(„)(x) is 


OO j 


(x) 


Hq{Xi 


q=m^ 


Two difficulties arise. First, m* might be smaller than m, the Hermite rank of {l{G(-)<a;}}xeR- 
Secondly, the coefficients Jq^n{x) depend on n and might converge uniformly to 0. Thus, it is 
a priori not clear which term of the Hermite expansion is asymptotically dominant or if there 
are even more than one. The next result is a reduction principle that lays emphasis on this 
aspects. We will make use of it in the proof of Theorem [H but is also of interest on its own. 
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Theorem 3. Let {Gn\n be a sequence of measurable functions and let m{n) be the sequence of 
Hermite ranks of {l{Gn(-)< 3 ;}}a^eR- Then, for any m E IN with m{n) < m < 1/D (for n> uq), 


P 


sup sup 


t€(0,l) a:GR C^n,m 


lTLt\ rn 

i=l q=0 






> e I < Cn-^(l + e-3), 


where C and k do not depend on n. 


Remark 2.12. (i) Theorem [3] contains the reduction principle of iDehling and Taaaul (jl989al i 
as a special case {Gn{x) = G{x) and m(n) = m). 

(ii) Note that {l{G„(-)<x}“-^(n)(^)}a:GR might has a Hermite rank smaller than m (say m* < m). 

Thus, one might expect d~\^» as normalization. The weaker normalization is however 
possible since the empirical process is approximated by additional terms of the Hermite ex¬ 
pansion, in detail those up to m. _ 

(iii) A similar result is given bv IWul (|2003l ). who considers linear long memory processes and 
even shows convergences with respect to a weighted supremum metric. Then again, he con¬ 
siders only the normal empirical pro cess, while we also treat the sequential version. Moreover, 
we consider triangular arrays, which IWul (j200j) does not. 

Corollary 2.13. Let {Gn}n be sequence of measurable functions and let m{n) be the sequenee 
of Hermite ranks o /If further m* < m{n) <m < 1/D for all n > uq and 


dn,q Jq,n{x) 


d. 


n^m 


q\ 


hq{x) Mq & {m*,... ,m}, 


uniformly in x, then 


\nt\ 


■^0-{Gr.(Xi)<x}- F(n){x)) ^ hq{x)Zq{t) 


i=l 


q=m'* 


(Zq, jy(t))ig[oq] are uneorrelated, qth order Hermite proeesses. 

Remark 2.14. (i) Comparing the limit process of Corollary 12.131 to that of Theorem [1] it is 
apparent that multiple Hermite Processes are involved. This is not the case in Theorem [TJ The 
reason is Assumption A2, which causes the Hermite coefficients Jm,n{x) to converg e rather fast. 

(ii) Th e Hermite processes occurring in the limit are dependent, see Proposition 1 in lBai and Taaau 
i2013l ). 

Remark 2.15. In view of the proof of Corollary 12.13l it is important to note that the functions 
hq are uniform limits of the cadlag-f unctions Jm.,n (x) and hence elements of oo,oo]. As a 
consequence they are also bounded (jPollardI (| 19841 )). 
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Example 2.16. There are indeed sequences of functions {Gn}n that satisfy the conditions of 
Corollary 12.131 Consider again the functions from Example 12.71 namely Gn{x) = x'^{lx>o + 
o>n^x<o) with On —>■ 1 and o^ / 1. Thus, we are in the situation of Theorem [3] with m(n) = 1 
for all n G IN. One obtains, o^ ^ Is 

sup| J2,n(a;) - J2ix)\ -)■ 0 , 

with 


Mx) = - 1)] = -‘^Vx(t>{Vx)l{x>o}- 

If in addition an ~ ~ dn, 2 /dn,i, then 


sup 

X 


^^Jl,n{x) - Cx(j){^/x)lx>0 
an,2 


—)• 0 , 


for some constant C depending on D only. Corollary 12.131 then holds with m = 2, m* = 1, 
hi{x) = Cx(j){y/x)lx>Q and /i 2 (x) = J 2 (x)/ 2 . 


3 Simulation Study 

3.1 Fractional Gaussian Noise 

Consider a mean-shift in Gaussian data. Then Example 12.41 states that the Cramer-yon Mises 
test (and the Kolmogoroy-Smirnoy test) are asymptotically as efficient as the CUSUM test. 
The goal of this simulation study is to examine whether this theoretical and asymptotic result 
carry oyer to the hnite sample performance of the tests. We will consider samples of size 50 
to 400. For these situations the approximation of the empirical process by its semi-degenerate 
limit process is quite inaccurate. The empirical size of the Cramer-yon Mises test will be 
therefore much larger than the nominal size, if critical yalues are deduced from the asymptotic 
distribution. Instead we simulate J = 1000 Gaussian time series 

. . . Xj^n j — 1, • • • T 

with Hurst coefficient H. In the simulation study we will use fractional Gaussian noise for this 
sequences. Subsequently, a Cramer-yon Mises statistic is calculated for each of the J = 1000 
Gaussian series, in detail 

k k ^ A ^ 

^{Xj^i<x} ~ ~ ^{Xj,i<x} j dFn,j{x) j = 1, . . . J. 

i=l i=l J 


Sn i = max 

’ l<k<n 
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Table 1: Empirical power, H assumed to be known, size of level shift /U = 1, relative change 
positions r = 0.2 and r = 0.5. 





Relative change position 

r = 0.2 





H -- 

= 0.6 



H = 

0.7 


n 

50 

100 

250 

400 

50 

100 

250 

400 

S 

0.196 

0.566 

0.854 

0.970 

0.158 

0.215 

0.547 

0.689 

W 

0.263 

0.525 

0.910 

0.983 

0.201 

0.233 

0.501 

0.636 

C 

0.288 

0.666 

0.933 

0.986 

0.276 

0.284 

0.555 

0.769 



H -- 

= 0.8 



H = 

0.9 


n 

50 

100 

250 

400 

50 

100 

250 

400 

S 

0.101 

0.221 

0.241 

0.350 

0.057 

0.098 

0.236 

0.167 

W 

0.089 

0.156 

0.264 

0.383 

0.089 

0.116 

0.191 

0.147 

c 

0.171 

0.234 

0.348 

0.349 

0.164 

0.127 

0.239 

0.223 




Relative change position 

r = 0.5 





H -- 

= 0.6 



H = 

0.7 


n 

50 

100 

250 

400 

50 

100 

250 

400 

S 

0.664 

0.919 

1.000 

1.000 

0.524 

0.682 

0.925 

0.970 

W 

0.621 

0.930 

0.998 

1.000 

0.513 

0.742 

0.906 

0.967 

c 

0.733 

0.918 

0.997 

0.999 

0.599 

0.717 

0.919 

0.960 



H -- 

= 0.8 



H = 

0.9 


n 

50 

100 

250 

400 

50 

100 

250 

400 

S 

0.418 

0.504 

0.655 

0.830 

0.359 

0.461 

0.475 

0.526 

W 

0.374 

0.485 

0.674 

0.770 

0.387 

0.430 

0.578 

0.587 

c 

0.400 

0.553 

0.673 

0.766 

0.393 

0.499 

0.522 

0.553 


We then use the empirical quantiles of {Sn^j^j^i as critical values. The Cramer-von Mises 
statistic is invariant under monotone transformations of the data (as is the Kolmogorov- 
Smirnov statistic). Hence the critical values are valid if our observations are monotone trans¬ 
formations of Gaussian data. We note that this is a strong assumption and that an accu¬ 
rate approximation of the empirical process for general long-range dependent data is an issue 
of future research. The CUSUM statistic is not invariant under monotone transformations. 
Therefore, the Wilcoxon change-point test is considered additionally. 

In the first part of the simulation study we treat realizations of a Gaussian process Xi ,..., 
given by fractional Gaussian noise). For the implementation we have used the function fgnSim 
from the R-package f Arma. Eventually a change is added hy Yi = Xi + //!{*> [nrjj the three 
mentioned change-point tests are applied to Ti,..., W- 
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Table 2: Empirical size, estimated Hurst coefficient. 




H = 

0.6 



H = 

0.7 


n 

50 

100 

250 

400 

50 

100 

250 

400 

Sh 

0.067 

0.088 

0.065 

0.058 

0.080 

0.076 

0.063 

0.043 


0.082 

0.105 

0.085 

0.083 

0.122 

0.143 

0.107 

0.081 

Wh 

0.067 

0.058 

0.054 

0.057 

0.081 

0.074 

0.061 

0.035 

Wh, 

0.070 

0.100 

0.102 

0.078 

0.122 

0.118 

0.100 

0.081 

Ch 

0.071 

0.072 

0.064 

0.046 

0.111 

0.080 

0.056 

0.056 


0.085 

0.090 

0.103 

0.081 

0.116 

0.127 

0.095 

0.078 



H = 

0.8 



H = 

0.9 


n 

50 

100 

250 

400 

50 

100 

250 

400 

Sii 

0.094 

0.104 

0.062 

0.063 

0.075 

0.070 

0.092 

0.080 

% 

0.136 

0.141 

0.099 

0.077 

0.087 

0.127 

0.118 

0.112 


0.085 

0.074 

0.056 

0.059 

0.073 

0.071 

0.097 

0.079 


0.129 

0.137 

0.087 

0.074 

0.098 

0.146 

0.115 

0.103 

Ch 

0.165 

0.101 

0.051 

0.046 

0.309 

0.257 

0.112 

0.075 


0.218 

0.127 

0.094 

0.072 

0.137 

0.130 

0.083 

0.063 


If the Hurst-coefficient is assumed to be known, the empirical size of the tests naturally equals 
the nominal one, due to the construction of the critical values. The empirical power of Cramer- 
von Mises (denoted by Sn), Wilcoxon (denoted by Wn) and CUSUM test (denoted by Cn) is 
displayed in Table [TJ If the change occurs in the middle of the observation period, the three 
tests are showing almost exactly the same performance, which matches the theoretical results. 
For early changes (after 20% of the observations) the CUSUM test is slightly more accurate 
than the other tests. Depending on sample size and strength of dependence, either the Cramer- 
von Mises or the Wilcoxon test might be second best. 


3.2 Unknown Hnrst coefficient 

In applications the true Hurst coefficient H is unknown, and in the following we will consider 
two different estimators. The fir st is the l ocal W hittle estimator (denoted by H) with band¬ 


width parameter m = see iKiinschl ( 19871 1. However, if there is actually a change in the 


data, the local Whittle estimator is known to be biased. For the second estimator we therefore 
divide the observations into two subsamples 


Xi,...,W and X, 


k+V 
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Table 3: Empirical Power, estimated Hurst coefficient, size of level shift /U = 1, relative change 
positions r = 0.2 and r = 0.5. 





Relative 

change 

position 

r = 0.2 





H -- 

= 0.6 



H = 

0.7 


n 

50 

100 

250 

400 

50 

100 

250 

400 

Sh 

0.178 

0.263 

0.597 

0.828 

0.189 

0.177 

0.285 

0.404 

% 

0.248 

0.491 

0.834 

0.946 

0.260 

0.377 

0.528 

0.620 

Wfy 

0.190 

0.298 

0.625 

0.811 

0.146 

0.193 

0.278 

0.386 

Wh, 

0.291 

0.532 

0.850 

0.943 

0.237 

0.364 

0.488 

0.649 

Ch 

0.289 

0.413 

0.706 

0.896 

0.312 

0.266 

0.379 

0.547 


0.378 

0.596 

0.874 

0.972 

0.312 

0.422 

0.585 

0.721 



H -- 

= 0.8 



H = 

0.9 


n 

50 

100 

250 

400 

50 

100 

250 

400 

Sh 

0.115 

0.125 

0.156 

0.185 

0.096 

0.105 

0.142 

0.163 

% 

0.208 

0.265 

0.287 

0.296 

0.160 

0.213 

0.208 

0.234 


0.150 

0.146 

0.161 

0.181 

0.094 

0.126 

0.143 

0.158 

Wh, 

0.214 

0.271 

0.281 

0.322 

0.137 

0.188 

0.237 

0.234 

c* 

0.405 

0.321 

0.270 

0.305 

0.539 

0.442 

0.367 

0.328 


0.313 

0.321 

0.383 

0.429 

0.418 

0.350 

0.311 

0.313 




Relative 

change 

position 

r = 0.5 





H -- 

= 0.6 



H = 

0.7 


n 

50 

100 

250 

400 

50 

100 

250 

400 

Sh 

0.541 

0.759 

0.985 

0.999 

0.412 

0.557 

0.814 

0.904 

% 

0.614 

0.860 

0.990 

0.999 

0.539 

0.710 

0.881 

0.952 

Wfy 

0.594 

0.811 

0.984 

0.999 

0.441 

0.564 

0.809 

0.902 

Wh, 

0.609 

0.877 

0.991 

1.000 

0.550 

0.717 

0.878 

0.950 

Ch 

0.677 

0.819 

0.988 

1.000 

0.584 

0.671 

0.850 

0.925 


0.694 

0.905 

0.995 

0.998 

0.567 

0.760 

0.920 

0.953 



H -- 

= 0.8 



H = 

0.9 


Sh 

0.373 

0.403 

0.535 

0.640 

0.337 

0.428 

0.472 

0.549 

% 

0.454 

0.563 

0.649 

0.711 

0.412 

0.480 

0.582 

0.604 


0.369 

0.413 

0.563 

0.648 

0.357 

0.387 

0.506 

0.536 

Wh, 

0.443 

0.566 

0.662 

0.710 

0.433 

0.528 

0.544 

0.599 

Ch 

0.576 

0.584 

0.655 

0.706 

0.659 

0.622 

0.631 

0.637 


0.535 

0.599 

0.716 

0.760 

0.604 

0.574 

0.637 

0.638 
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Table 4: Empirical Power, estimated Hurst coefficient, relative change position r = 0.5, level 
shift of size /U = 1 and change in variance from cj^ = 1 to Ug = 5/4, nominal size a = 0.05. 




H = 

0.6 



H = 

0.7 


n 

50 

100 

250 

400 

50 

100 

250 

400 

Sh 

0.701 

0.931 

1.000 

1.000 

0.606 

0.772 

0.963 

0.996 


0.878 

0.986 

1.000 

1.000 

0.817 

0.973 

0.999 

1.000 

Wfy 

0.609 

0.812 

0.989 

1.000 

0.572 

0.593 

0.879 

0.952 


0.734 

0.973 

1.000 

1.000 

0.705 

0.907 

0.988 

0.998 

Ch 

0.660 

0.899 

0.999 

1.000 

0.529 

0.724 

0.938 

0.983 


0.588 

0.916 

1.000 

1.000 

0.507 

0.806 

0.983 

0.998 



H = 

0.8 



H = 

0.9 


Sh 

0.466 

0.636 

0.824 

0.898 

0.507 

0.599 

0.755 

0.797 


0.762 

0.942 

0.983 

1.000 

0.824 

0.960 

0.993 

0.993 

Wh 

0.597 

0.568 

0.669 

0.727 

0.718 

0.645 

0.582 

0.634 


0.616 

0.850 

0.944 

0.981 

0.562 

0.823 

0.926 

0.953 

Ch 

0.445 

0.601 

0.806 

0.853 

0.438 

0.551 

0.731 

0.762 


0.443 

0.635 

0.852 

0.937 

0.460 

0.576 

0.781 

0.847 


and estimate H on each set, using again the local Whittle estimator. Finally the new estimator 
is given by Hf, = k/nHi + {n — k)/nH 2 - Here k is the natural change-point estimator, associated 
to each test. For example, in case of the Cramer-von Mises test we use 


k = min <1 < k < n — 1 I Ukn = max Uk i 

l<fc<n-l 


where 


Uk,n — 



^{Xi<x} ~ ~ ^{Xi<x} I dFnix). 


2=1 


Harizetal 


(2009|). iHorvath and Kokoszkal ( 19971 1 


Consistency of this estimator was shown in 
verified consistency for the analogous CUSUM-based estimator. 

Empirical size and empirical power of the tests under unknown H are displayed in tables [2] and 
[3l Let us first compare the impact of the different estimators H and Hj^ on the finite sample 
performance of the Cramer-von Mises test. If we use the classical local Whittle estimator, the 
empirical size of the test is quite accurate and even matches the nominal size for n = 400 and 
H < 0.8. However, there is a loss in the empirical power. The power performance is much 
better, if the local Whittle estimator is modified. Actually there is no loss in power if compared 
to the case where H was assumed to be known. Then again, the probability of a false rejection 
is higher than a = 0.05, so the test is quite liberal. 
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Table 5: Empirical Power, estimated Hurst coefficient, relative change position r = 0.5,Gi(x) = 
G 2 {x) = + x/2 + 1/2, nominal size a = 0.05, H is the Hurst coefficient of the underlying 

Gaussian. 




H = 

0.6 



H = 

0.7 


n 

50 

100 

250 

400 

50 

100 

250 

400 

Sh 

0.535 

0.827 

0.983 

0.999 

0.487 

0.758 

0.957 

0.988 


0.494 

0.815 

0.992 

0.999 

0.479 

0.750 

0.968 

0.995 


0.480 

0.743 

0.990 

0.998 

0.430 

0.632 

0.924 

0.986 


0.420 

0.735 

0.986 

1.000 

0.407 

0.657 

0.933 

0.987 

Ch 

0.424 

0.616 

0.853 

0.958 

0.399 

0.547 

0.745 

0.858 


0.387 

0.569 

0.828 

0.920 

0.390 

0.546 

0.755 

0.891 



H = 

0.8 



H = 

0.9 


Sh 

0.461 

0.670 

0.825 

0.885 

0.424 

0.522 

0.607 

0.614 

% 

0.474 

0.680 

0.860 

0.934 

0.507 

0.589 

0.698 

0.704 


0.376 

0.537 

0.670 

0.773 

0.350 

0.369 

0.443 

0.438 

Wh, 

0.418 

0.555 

0.738 

0.828 

0.458 

0.496 

0.488 

0.511 

Ch 

0.374 

0.464 

0.564 

0.596 

0.384 

0.352 

0.343 

0.352 


0.440 

0.538 

0.655 

0.772 

0.397 

0.491 

0.562 

0.579 


Next we compare CramCT-von Mises, Wilcoxon and CUSUM test. The empirical size of the 
three tests is similar, no matter which estimator we choose and which situation we assume 
(sample size, Hurst coefficient), see Table [2j 

In terms of empirical power the Cramer-von Mises and Wilcoxon test give similar results with 
the CUSUM test being slightly ahead for r = 1/2 and being clearly advantageous for early 
changes r = 1/5 (see Table ED . 

We have to keep in mind that CUSUM and Wilcoxon test are designed to detect changes in 
the mean. On the contrary, the Cramer-von Mises test is a so called omnibus test and has 
power against arbitrary changes in the marginal distribution. 

Therefore, we consider another situation, with the mean-shift being now accompanied by a 
small change in the variance. In detail. 


y _ \^i for * < [ri'rJ , 

I aXi + n for i > [nrj, 

for Gaussian {Xj}j>i. The theoretic result from Example 12.111 indicates that in this scenario 
the Cramer-von Mises test should be advantageous. In fact, for all combinations of sample size 
n and Hurst coefficient H the empirical power against this change is always higher than the 
power against a mean-shift under constant variance. Moreover, the Cramer-von Mises test has 
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Table 6: Empirical size and power for farima{0, 0.2,0)-sequences, Hurst coefficient is esti¬ 
mated, nominal size a = 0.05. 




No change 



Mean-shift p = 

1 

n 

50 

100 

250 

400 

50 

100 

250 

400 

Sh 

0.036 

0.058 

0.069 

0.056 

0.276 

0.490 

0.832 

0.920 


0.167 

0.148 

0.119 

0.129 

0.520 

0.737 

0.939 

0.986 

Wfy 

0.067 

0.139 

0.086 

0.074 

0.601 

0.882 

0.968 

0.730 


0.247 

0.189 

0.160 

0.153 

0.573 

0.711 

0.934 

0.980 

Ch 

0.104 

0.061 

0.053 

0.048 

0.281 

0.479 

0.836 

0.945 


0.212 

0.202 

0.158 

0.114 

0.499 

0.682 

0.937 

0.977 


clearly higher power then CUSUM and Wilcoxon test, which matches the theoretical findings 
of Example 12.111 

Moreover, we consider the change-point problem (based on non-monotone transformations) 


h* = 


xf 

Xf aXi -t- /i 


for i < [nrj, 
for i > [nrJ, 


corresponding to a situation in which mean, variance, skewness and the Hermite rank change 
(see Example 12.8p . Table [5] displays the empirical power of the three tests against this al¬ 
ternative and the picture is quite clear. The Cramer-von Mises test has the highest power 
for all combinations of H and n, while the Wilcoxon test is second best. Also note that the 
Hermite rank of the pre-change random variables is m = 2. Consequently, these observations 
are short-range dependent for H < 0.75. 


3.3 farima{0,d,0)-processes 

Eor Gaussian long memory processes beyond fractional Gaussian noise, not only the Hurst 
coefficient determines the normalization. Instead it is given by 

dn = n^L^/\n){H{2H 

see m- In this study we assume, as n ^ oo, L{n) C, which is quite common in the 
literature. Eor fractional Gaussian noise, C = H{2H — 1) so the two factors just cancel out. 
In general the constant C is given through the limit 

p{k)k‘^-^^ C, 
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Table 7: Empirical size and power for farima{l, 0.2,0)-sequences with AR-coefficient ai = 0.4, 
Hurst coefficient is estimated, nominal size a = 0.05. 




No change 



Mean-shift p = 

1 

n 

50 

100 

250 

400 

50 

100 

250 

400 

Sh 

0.032 

0.021 

0.030 

0.034 

0.154 

0.183 

0.329 

0.433 


0.126 

0.065 

0.062 

0.052 

0.316 

0.325 

0.415 

0.489 


0.038 

0.009 

0.003 

0.007 

0.157 

0.158 

0.192 

0.266 

^H, 

0.183 

0.048 

0.015 

0.017 

0.363 

0.313 

0.301 

0.387 

Ch 

0.445 

0.288 

0.124 

0.081 

0.625 

0.615 

0.605 

0.689 


0.422 

0.302 

0.138 

0.093 

0.592 

0.613 

0.656 

0.710 


as A: —>■ oo. We suggest an estimator for C (which is quite heuristic) by: 

C = , (3.1) 

k=l 

with H being one of the two estimators from above. Finally, we use the normalization 

= n^C^/^{H{2H - l))^/2_ 

The estimator C in (13.ip is only dehned under long memory, that is 77 > 0.5 (or in this situation 
H > 0.5). Therefore, we modify both estimators by considering max(77,0.501) instead of H. 
The effect of this modification on short memory processes will be seen in the next section. 
However, for farima{0, d, 0)-sequences it seems to work quite well, see Table [H Note that 
critical values are still deduced from fractional Gaussian noise. The hnite sample performance 
(under the hypothesis as well as under a mean-shift) is very similar to the case where the data 
comes from fractional Gaussian noise. Meaning, the Gramer-von Mises test has good properties 
and the different tests yield very similar results, again matching the theoretic findings. 

3.4 Short-range dependent effects 

Finally, we have considered deviations from purely LRD sequences by simulating farima{l, d, 0)- 
time series and short memory AR(l)-processes. 

First, we have applied the tests to farima{0,d, l)-sequences, which are still long-range depen¬ 
dent. Table [7] indicates that the empirical power of the Gramer-von Mises test is less than in 
the case of farima{0, d, 0)-processes. However, the test works principally well, meaning that 
the power increases with the number of observations while the empirical size stays close to the 
nominal size. For GUSUM and Wilcoxon test this seems to be not the case. 
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Table 8: Empirical size and power for ^i2(l)-sequences with AR-coefficient ai = 0.6, Hurst 
coefficient is estimated, nominal size a = 0.05. 




No change 



Mean-shift ^ = 

1 

n 

50 

100 

250 

400 

50 

100 

250 

400 

Sh 

0.016 

0.008 

0.004 

0.002 

0.135 

0.150 

0.324 

0.576 


0.097 

0.030 

0.012 

0.004 

0.282 

0.260 

0.374 

0.556 

Wfy 

0.036 

0.004 

0.000 

0.000 

0.149 

0.106 

0.106 

0.213 

^H, 

0.141 

0.018 

0.000 

0.000 

0.336 

0.249 

0.235 

0.343 

Ch 

0.467 

0.279 

0.017 

0.006 

0.641 

0.592 

0.606 

0.721 


0.412 

0.219 

0.026 

0.006 

0.654 

0.631 

0.678 

0.794 


For the (purely short-range dependent) AR(l)-processes we make two observations: First, 
due to the assumption of LRD (ff > 0.5) the normalization is too strong and the statistics 
converge to 0, at least under stationarity. If the structural change is big enough, the tests 
might still detect the change (see Table [8|). However, there is a certain loss in power. 
Secondly, Cramer-von Mises test and CUSUM test are showing a quite different finite sample 
performance. While under LRD (in concordance with the theory) their empirical size and 
power is always very similar, we now observe situations where the Cramer-von Mises test has 
empirical power 0.374 and the CUSUM test 0.678, see the results in Table [HI Again, this 
matches the theoretical fact that under short memory the tests show a different asymptotic 
behavior. 

4 Proofs of the main results 

4.1 Proof of Theorem [3] and Corollary 12.131 

It is the goal to approximate the sequential empirical process by a linear combination of 
multiple partial sum processes. The indicator function 1{g„(Xj)<x} the Hermite expansion 

q=o 

Remind that Jg,n(x) = and especially Jo,n(x) = F(Gn(Xj) < x) = 

Now let be the Hermite expansion up to m, in detail 

m j- z . 

Lm,nA^) = Y. (4.1) 

q=0 
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Let m{n) be the Hermite rank of {^{G„{Xj)<x})x- Then we have by the conditions of Theorem 
[3] that m* < m{n) < m for some m* <m< 1/D. Thus 

m 

Lm,n,j{x) = ^ ^ Jq^n{x) / QlHq[Xj). 


q=m^ 


Moreover, define 


1 ^ 

Sn{l',x) = - \ ^{Gn(Xj)<x} ~ Lm,n,j{x)j ■ 

^r?. .m. ^ ' 


^n.m ■ 1 

’ j=i 


Finally, let 5n(/c;x,y) = Sn{k-,y)-Snik; x), Lm,n,j{x,y) = Lm,n,j{y)-Lm,n,jix) and Jn,q(x, ?/) = 

Jn,qiy) Jn,q{x). 


We will make use of the chaining technique of iDehling and Taaod ()l989al L To this end, dehne 


■^ni^x') .— 


^ r-''? 


Hq{s) 


'{Gn(s)<x} \ Q ?! 


^(s) ds 


and observe that Jq^n{x,y)/q\ is bounded by A„(x,y) = An{y) — A„(a:), for all n G IN and all 
q = 0,... ,m. Furthermore, A„ is monotone, A„(—oo) = 0 and 


An{+00)=[ (± 

V,=o 


Hq{s)\ 


q\ 


(j){s) ds = C < oo, for all n G IN. 


(4.2) 


Define partitions, similarly to lDehling and Taaod (jl989a|), but now depending on n, by 

Xi{k) = x["'\k) = inf{x| A„(x) > A,^(+oo)^2“*^} i = 0, • • • , 2*^ - 1 

for A: = 0, • • • ,K, with the integer K chosen below. Then we have 

An{Xi{k)-) - An{Xi-i{k)) < An{+oo)2~^. 

Note that the right hand side of (j4.2n does not depend on n. 

Based on these partitions we can define chaining points ffc(x) by 

Xi^(x){k) <x< Xi^(^)+i(k), 

for each x and each /c G {0,1,..., AT}, see Dehling and Taa^ (1989a). 

Lemma 4.1. Define the chaining points as above. Suppose the following two conditions hold: 
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(i) There are constants 7 > 0 and C > 0, not depending on n, such that for all k < n 

E\Sn{k;x,y)\'^ < C y). 

(a) For all e > 0 and a// n E IN there is a real number K = K{n,e), such that for all \ > 0 

2-m*D 


P I sup 


d. 




n,m ■ -I 
’ J =1 




X-m*D 


n 


Then there is a constant p > 0, such that for a// n E IN and all e > 0 the following holds: 

P (^sup\Sn{l-, x)| > < C n-^e-2(iC(n, e) + 3)^ + C 

Proof. Due to definition of the chaining points each point x is linked to —00 in detail 


We have 

K 

Sn{l;x)= ^S’n(f;Xjj^_^(2,)(A: - l),Xij^(3.)(fc)) + 5 n(Z;Xi^(a.)(W),x). 
k=l 

The last summand of the right hand side of (|4.3p can be treated as follows 

1 


(4.3) 


I Sn(fj ('^)) I 


d: 


< 


'n,m 

1 


i^{:>:ij^(x){K)<Gn{Xj)<x} I^m,njiXij^(x){K), x) j 
.7 = 1 ^ ^ 




'n,m 


i=l ^ 


+ 2 


ri,m 


y ^ ^rn,n,j ^ix(a;)+l ('^) 

i=i 

l^nil^ Xij^^x)i^) j Xj^j^(^x)+li^) )| 

I 

y ^ ^m,n,j {Xi^(^x) 3^ix(a:)+l 

i=i 


+ 2 


1 


ri,m 


(4.4) 
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By (|4.3p and (j4.4p we get, using + 2) ^ < 1/2, 


P ( sup|5n(^;x)| > e 


< P ^sup|5„(Z;x)| > e ^(/c + 2) ^ + e/2^ 

K 

- - '^),Xi^(cc){k))\ >e/{k + 2)' 


k=l 


+ P [max\Sn{l;Xij^(^^){K),Xij^(^^)+i{K)-)\ > e/{K + 3)^ 


+ p 2 d 


-1 

'n,m 


^ ^ ^rn,n,j {^)i ^iK{x)+l 

j<l 


> {e/2) 


(4.5) 

(4.6) 

(4.7) 


Further, by condition (i) of Lemma l4.II and the Markov inequality we get 
p ^ifc+i(p(^ + l))l > e/{k + 2)2 

2fc+l_i 

< ^ P (5„(Z;xj(A; + l),Xi+i(A: + 1)) > e/(/c + 2)2) 


< 


i=0 

2k+l_l 

c E 


I \ ^—^{k P 2 )^ 


< C ( 

n / 


n 


■F(^n){xi{k + l),Xi+i(A: + 1)) 


(4.8) 


The constant C in (14.8p is the constant of condition (i) in Lemma O and thus independent 
of n. In the next line this C gets multiplied with A„(+oo), which is a constant by itself. Thus 
the C in the inequality above is a universal constant, not depending on n. The same is true 
for 7 . 

Using the same arguments we get moreover 


P (^max|S„(;;Xij^(,,)(P:),Xi^(,,)+i(P:)-)| >e/(iL + 3)2) <C 
Finally we have by condition (ii) of Lemma l4.ll 


l\ _JK + 2,Y 


n 


n 


P 2d 


-1 


^ Pm,n,j{Xif^(x){K)-i Xi^(^x)+l{P-) ) 

j<l 


>{e/2)\<C{- 


2—m*D 


n 


\-m*D 
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for all A > 0. Combining the estimates for (14.5p . (|4.6I) and (14.7p we arrive at 


P ^supIS'nP/; x)| > < C n '^e + 2)^ + C 


2-m*D 


^X—m*D 


n 


< C ( - ) n-^e-^{K + 3)^ + C ( - 


n 


2—m*D 


n 


n 


\-m*D 


which finishes the proof. □ 

Lemma 4.2. There exist constants 7 and C, not depending on n, such that for all k < n 


E\Sn{k]x,y)\'^ <C 


The proof is very close to the proof of Lemma 3.1 in lDehling and Taaad (jl989al h However, for 
further results it is crucial that C and 7 only depend indirectly on the function Gn, namely 
through the Hermite rank. Thus we give a detailed proof to highlight this fact. 

Proof. First, obtain the Hermite expansion 

1(«gv(.v,)<,)-F(„)(2^.9)= E E}^h,{X.). 

q=m* 

Secondly, we have by orthogonality of the Hq{Xi) and EHq{Xi) = q\ 

Jq,n{x, y) 


g Jlnix,y) 


q=m^ 


q\ 




q=m^ 


-Hq{X,^ 


r I E 


q=m'^ 


^ {^{x<Gn{Xi)<y} Ei^j^^(x,y)^ 

= F(^n){x,y){l - F(„)(x,y)) 

< F(^n){x,y)- 


This yields 


00 j'2 1 

E{dn,mSn{k-,X,y)f = ^ EHq{Xi)Hq{Xj) 


q\ q\ ^ 

q=m+l ^ ^ i,j<k 

< E(^n)ix,y) ^\r{i- j)r""^. 
id<k 


Note that the second factor of the product in the last line may depend indirectly on the function 
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Gn, because Gn determines m, however this is the only influence. For different combinations 
of m and D the term — 7 )!™'+^ might have a differen t asymptotic order. However, 


in all cases we get (see page 1777 in iDehling and Taaaul (jl989alH 
1 


d, 




n.m . 

l,J<K 


N lV(2-(m+l)D) 


The result then follows because L and Li are slowly varying. 


□ 


Lemma 4.3. Let n G IN and e > 0. Define the chaining points and Lm.,n,j{x) as in {4G). Set 


K = K{n,e) = 


log2 


(m - m* + 2)A„(+oo) . 

-:- ndn,m 


+ 1 . 


Then there is a constant G > 0, such that for all X > 0 
! ^ ^ (-^)) ^iK(x)+l i.dL) ) 


P I sup 
x€lR 


n.m ■ T 

’ 7=1 


> e < C 


2—m*D 


n 


X-m*D 


Proof. By construction of the chaining points we have for q = 0,... ,m and for all x G IR 


snp\Jq,n{xij^(x){K),Xij^(^^)+i{K)-)/ql\ < A„(+oo)2 . 


Thus for all X G K, 






7=1 


< 


'y ^j \Jq,niXij^(x) {dL ), {dL) ) / q\\- 


5=0 


^n,m 


T,n,(x,: 


m - 

< A„(+oo) 2 -^^ — 


r, ^n,m 

5=0 


7=1 


By definition of K 


2 ^e 


> 


n 


{m-m* + 2)A„(+oo) dn,m' 
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Therefore we get by Markov’s inequality for q = m*,,m 


P 1 A.„(+C)o)2 


-K 




t=l 


> e/(m — m* + 2) < P 


i=i 


> n 


< c% 

j2-qD 

/ 1 \ 2—m*D 


X-m*D 


For q = 0 the term is deterministic, thus the probability is 0. 


□ 


Proof of Theorem O The two conditions of Lemma rd.ll are satisfied (see Lemma r4.2l and Lemma 
with 


K = 


log., I 


+ 1 . 


Note that {K + 3)® < Ce for any <5 > 0, see Dehling and Taaou (1989a), page 1781. By 
this fact and by virtue of Lemma 14.11 


P ^sup|5n(/; x)\ > < C '^e ^ + C 


2-m*D 


X-m*D 


n 


< Cn-P U-)e-^+(- 

n) \n 


2-m*D' 


with p = min (7 — 6,m*D — A). Now choose <5 < 7 , then p > 0 and we have thus proven a 
reduction principle in x. It remains to verify uniformity in 1. For n = 2^ one gets by the same 
arguments as in the proof of Theorem 3.1 in lDehling and Taaod (|l989al l 


P ( maxsup|S'„(/;x)| > e ] < Cn '‘(1 + e 
V X J 


for any 0 < e < 1 and universal constants C and e. Next consider arbitrary n and dehne for r 
such that 2 ^“^ < n < 2 *' 


X](^{Gn(Xp<x} - Lm,n,j{x)) for I < 2 ^ 




2^,?n • T 

’ 7=1 
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where {Gn{Xj)}n£-Mj< 2 ^ is a (slightly modified) array. One obtains 


Hence 


P 


( maxsup|5*(/;x)| > e 
V X 




Kn 


P ( max sup I 3:^)1 > € I < P ( maxsup|S'*(Z; x)| > e 


dn 


Kn 


< Cn-'‘(l + e-3). 


d2^,r 

—3 / 

dn.m. 


The last line holds since d 2 '’,m/dn,m is uniformly bounded away from 0 and oo. Thus, Theorem 
[3] is proven. □ 

Proof of Corollary \2fJ^ Using the reduction principle, namely Theorem [3l it remains to show 
that 

q=m* i=l 

converges to the desired limit processes. Define 


lnt\ 


Zn,q{t) = 


'n,q 


i=l 


and note that because of 1/m > D the sequen ces {ff^(Xi)|,;>i are l ong-range dependent for 
q = m*,..., m. Then we have by Theorem 4 of lBai and Taaaul (|2013l l 


V 


{Zn,m* ! • • • ) Zn^m) ^ {Zm* ) • • • j -^m) j 


(4.10) 


where convergence takes place in (I1[0,1])™'“™'*"’“^, equipped with the uniform metric. More¬ 
over, (■^g(t))te[o,i] are uncorrelated Hermite processes of order q. The functions hq are elements 
of D[—oo,oo] and therefore they are also bounded, see Remark 12.151 Hence we may apply the 
continuous mapping theorem and conclude that 

m [nt\ 

hq{x)d-^qY^q(Xi) 
q=m* i=l 
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converges in distribution to 


S £ hq{x)Zq{t) i , 

19 =”^* J t,x 

where convergence takes place in Z)([0,1] x [— 00 , 00 ], equipped with the supremum norm. 
The result then follows by the uniform convergence of dn,m/dn,qJq,n{x) towards q\hq{x), the 
reduction principle and Slutsky’s theorem. □ 


4.2 Proof of Theorem [T] and Theorem [2] 

We start by proving a reduction principle for the empirical process in presence of a change 
point. Consider the array {Tn,i}neiN,i<n) dehned in section [2l and let Hn^i{x) = P{Yn,i < x). 
Dehne 


\nt\ 


hYr,,i<x} - Hn,iix) - ^ 




■ 

’ 1 — 


2=1 


q=m^ 


q\ 


Hq{X, 


where Jq,n,i{x) = E[l[Y„,i<x}Hqi^i)]- Note that Jq,n,i{x) = 0 if i < [nrj and q < m. 

Lemma 4.4. Let the conditions of Theorem{I\hold. Then there are constants C > 0 and k > 0 
such that for all e > 0 


Proof. Define 


and Sn, 2 {t,x) = 


By Theorem [3] we have 


pi sup sup|Si’")(t,x)| > e <Cn-^il + €-^). 
\,te[0,l]a;eR J 


n = ^T,( - F(x) - 

^n.m . 1 V 

j=i 

^ / m 

^ I ^{Gn{Xj)<x} — ^ 


dn 


'^q,n{x) jj ^ 

, Pq\Xjj 


q=m* 




P [ sup sup|S'n,i(t,x)| > e J < Cn '’’(1 + e f = 1,2. 
Ue[0,l]a;eR ’ / 


(4.11) 
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Next obtain 


Sn,i{t,x), if t < r, 

Sn,2(t, x) + Sn,liT, x) - Sn,2{T, x), if t > T. 


SV{t,x) = 


Therefore, we get, using (I4.1ip several times, 


P 


sup sup 1(t, x) I > 
tG[0,l] xGR 


< 2P 


sup sup|S'„,i(t,x)| > e/4 

ye[0,i]3;GR 


+ 2P 


sup sup|S'n, 2 (f,x)| > e/4 
\te[o,i] a;€R 


< 4Cn-'‘(l + e-3). 


for all n G IN and all e > 0. 


Lemma 4.5. Let Assumption A hold. Then for all q < m 


sup dn^m* / dn,m\Jq,n{x^ e/q(x)| y 0, 

as n ^ oo. 

Proof. Using Holder’s inequality, one has for any p € IN 


□ 


(4.12) 


\Jg,n{x) </j(x)| — \E (^(l^Q^i^Xi)<x} ^{G{Xi)<x})Hq{Xi)^\ 

< C{E\l[Gn(Xi)<x} - ^{G(Xi)<x}\Y^'"'^^^'‘ 

Now obtain 


EY{Gr,{Xi)<x} - ^{G{Xi)<x}\ 

= P({G„(Xi) <x,G(Xi) >x}U{G„(Xi) >x,G(Xi) <x}) 

= l-P {{GniXi) < X, G{Xi) <x})-P {{Gn{Xi) > X, G(Xi) > x}) 
= P(min{Gn(Xi),G(Xi)} < x) - P(max{Gn(Xi), G(Xi)} < x) 

= o{nY^* , 
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for some <5 > 0. The last line holds uniformly due to Assumptions A2. Finally, 


^ C - 1{G(AX)<X}|) 


Choosing p > 1/(5, this implies (I4.12p . 

Proof of Theorem [7J By definition of the functions Jq,n,i we get 

l^d m 


1 


d, 


EE 




n.m ■ 1 Q 

’ 2=1 q=m* 

1 ^ u iy 


^n.m — . 1 

’ 2=1 

m —1 

+ l{i>r} X] 


d-n^q Jq,n{^') 1 


=^* dn^q 


q=m 


\nt\ 

E 

2= [nrj +1 
[ntj 


Jm,n{x) Jm{x) 1 tlt tv\ 

+ Ht>r} --p-3- 2^ 

^n.m. . . , . 


n,m . I I , . 
2= [nr J + l 


□ 


The second and the third summands are negligible due to the uniform convergence of the 
functions Jq^n (see Lemma 14.51) 1. The first summand converges in distribution towards 


Jm{x) 

ml 


Zm{t), 


see 


Dehling and Taaoul (|l989al L Together with Lemma 14.41 this finishes the proof. 


□ 


Proof of Theorem [M We give the proof for a sequence of local alternatives. The asymptotic 
behavior under the hypothesis then is an immediate consequence. Obtain the following de- 
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composition of the empirical bridge-process 


where 


, / 

-El 

yi=i 

- E(i 

n;m ^ 


[nt\ ^ 


2=1 


^^n,i(a:)j - ty~] ( 1 




2=1 


-\- { t — 


Tl 

+ -Vln,T(i) (^(a;) - J^(n)(x)) , 


\nt\ \ 1 


n / d 




2=1 


^n,m 



'071,T (0 



if t < r, 
if t > r. 


(4.13) 


By uniform convergence of n/dn,m{F{x) — F(„)(x)) and '4’n,T(t) towards ( 7 (x) and 0r(0, respec¬ 
tively, Theorem [1] and the continuous mapping theorem, one gets that (14.131) converges weakly 
towards 


Jm{x)/(ml) {Zm{t) - tZm{t)) 'll:r{t)g{x). 


The convergence of the Kolmogorov-Smirnov type statistic then follows from continuity of 
the application of the supremum norm. The Cramer-von Mises statistic Sn can be written 
Sn = supig[o,i] Mn{t), where 
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= d-^ 

'^n,m 


n / 

/ kyn,i<x} 

Vi=i 


[nt\ 


n 


Y kyr.,i<x} dF{x) 


(4.14) 


2=1 


r I ^1 \ . 

+ dn% / Z] ^{yr.,^<x} -^Y kyn.i<x} d{Fn{x) - F{x)). (4.15) 

Vi=i ^ i=i / 


Due to the convergence of (|4.13l) and the continuous mapping theorem, (j4.14p converges to the 
desired limit process. Thus, it remains to show that (|4.15p is negligible. Therefore, obtain 


d-2 

^n,m 


'K 


/ 

\i=i 


-^Y kyr^,i<x} I d{Fnix) - Fix)) 


2=1 


Jmix)/{m\)Zmit) - iprikgix)) d(Fn(a^) “ Fix)) 


(4.16) 




[nt\ 

{Y„^i<x} ^ 2^ ^{y7i,i<x} 


2 = 1 


-[jraix)/im\)Z^it)-'iprit)gix)] > d(F„(x) - F(x)). (4.17) 


Using the Skorohod-Dudley-Wichura representation theorem (whose conditions are satisfied 
because Jmix)/im\)Z^it) — 'ipTit)gix) lays almost surely in (^([0,1] x [— 00 , 00 ])), one can 
assume without loss of generality that 


[ntj 


dn'fm I Y kyn,i<x} - Y kyn,i<x} 1 - ( Jmix)/iml)Zmit) - Ai^dix) 


2=1 


2=1 


converges almost surely to 0, uniformly in x and t. Thus, (14.1711 converges to 0, uniformly in 
t. Next consider (|4.16p 


Jmix)Iim\)Zmit) - 'tprit)gix) d(i4(x) - T(x)) 


/R 


= iZmit))kim\)^ [ Jlix) diFnix) - Fix)) 

Jn 

- 2Zmit)-iprit)/im\) [ Jmix)gix) diPnix) - Fix)) 
4R 

+ V’?(^) [ g^{x) diPnix) - Fix)) 

4r 

— In I In F Illn- 


As a consequence of Theorem [T] and F^^^^ix) —>■ Fix) one gets a weak Glivenko-Cantelli type 
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convergence, in detail 


sup|F„(x) — F{x)\ < sup 


^n{x) - 


Hn,i (x) 


i=l 


n 


n—Inrl ,, ^,,,p 

H-^ sup F(„)(x) - F(x) 0. 

n xgR 


Moreov er, obtain that Jm{x) is of bounded variation (this was also noted in iDehling and Taaou 
(jl989al il. To see this, let [a, 6] be an arbitrary interval and a partition of this interval. 

Then 


n—1 


n—1 


J(xi+i) J{xi)\ — 'Y^\F[^\xi<G{Xx)<Xi+i}Hm{Xi)\\ 


i=0 


i=0 

n—1 


< 


Y ^i^{xi<G{Xi)<Xi+i}\HmiXi)\] 


i=0 


= E 


'n-l 


Y, ^{xi<G{Xi)<Xi+-i}\Hm{Xl)\ 
. 1=0 

= E [l{G{Xi)€[a,b]}\Em{Xl)\] 

< E\Hm{Xi)\. 


By the boundedness of Jm, Jm is also of bounded variation and thus integration by parts, 
together with the weak Glivenko-Cantelli-type reulst, yields 

In = [ {Fn{x) - F{x)) djl{x) A 0. 

By definition, the function g{x) is bounded and of bounded variation. Hence the same is true 
for g^{x) and by the same arguments as above one gets 11 In = op{l). Finally, I In = op(l)) 
which can be seen using Holders’s inequality. This finishes the proof. □ 

Remark 4.6. Note that our proof of the weak convergence of the Cramer-von Mises statistic 
would not work for short-range dependent time series. The reason is the completely different 
limit behavior of the sequential empirical process. Instead of the semi-degenerate process 
Jm{x)Zni{t) one gets a Gaussian process K{t,x). While Jm is of bounded variation, this is 
not the case for sample paths of K. Hence J-^K{t,x) d{Fn{x) — F{x)) cannot be treated 
simultaneously to A d{Fn{x) — F{x)). 
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